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INTEGRATED DIGITAL SIGNAL 
PROCESSOR/GENERAL PURPOSE CPU 
WITH SHARED INTERNAL MEMORY 

This is a continuation of application Sen No. 08/011,102 5 
filed on Jan. 29, 1993, now abandoned, which is a continu- 
ation of application Sen No. 07/467,148 filed on Jan. 18, 
1990 now abandoned. 

BACKGROUND OF THE INVENTION 10 

1. Field of the Invention 

The present invention relates generally to data processing 
systems and, in particular, to a processing platform that 
provides integrated general purpose and digital signal pro- 15 
cessing (DSP) capabilities for recovering and processing 
digital data utilizing an internal shared memory resource. 

2. Discussion of the Prior Art 

The basic function of any communications system is to 
transmit information over a communication channel from an 20 
information source to a destination as fast and as accurately 
as possible. 

There are two general types of information sources. 
Analog sources, such as a telephone microphone, generate a 
continuous signal Digital sources, such as a digital data 25 
processing system, generate a signal that consists of a 
sequence of pulses. 

Communications channels that are designed to transmit 
analog signals (e.g., the telephone network) have character- 
istics which make it difficult for them to transmit digital 30 
signals. To permit the transmission of digital pulse streams 
over an analog channel, it is necessary to utilize the digital 
data pulses to modulate a carrier waveform that is compat- 
ible with the analog transmission channel. ^ 

The equipment that performs the required modulation is 
generally referred to as a "MODEM". The term "MODEM" 
is an acronym for MOdulator-DEModulator, since one piece 
of equipment typically includes the capability not only to 
modulate transmitted signals, but also to demodulate ^ 
received signals to recover the digital data from the modu- 
lated analog carrier waveform. 

While passing through the transmission channel, the 
modulated carrier waveform suffers from distortion intro- 
duced both by the system itself and by noise contamination. 45 
Thus, one of the tasks of the modem's demodulation func- 
tion is to filter the signal received from the transmission 
channel to improve the signal-to-noise ratio. The demodu- 
lator also retrieves timing information from the received 
signal to provide sampling points for recovering the digital 53 
data. The demodulator may also condition the data in other 
ways to make it suitable for additional processing. 

In a conventional modem, the signal filtering, sampling 
and conditioning tasks are performed by three functional 
units: analog-to-digital conversion circuitry ("analog front 55 
end**) that converts the received modulated carrier waveform 
to a Higiriyivi replica, a digital signal processor (DSP) that 
recovers the digital data from the digitized replica, and a 
control function for controlling both the analog front end 
and the digital signal processor. The digital signal processor 60 
recovers the data by implementing a signal conditioning and 
data recovery algorithm that is specific to the type of data 
being received. 

For example, the digital signal processor function in a 
facsimile (fax) ma chin e modem implements a special pur- 65 
pose algorithm that can only be used for recovering digital 
fax data. In the case of a fax system, the data to be recovered 



is a digital fait map that corresponds to the transmitted hard 
copy image and which has been compressed to facilitate 
efficient transmission. The algorithm implemented by the 
digital signal processor function of the receiving fax 

5 machine* s modem is a dedicated "fax** algorithm that has 
been designed specifically for accurately recovering the 
compressed bit map. It cannot recover digital data in a 
format other than a compressed bit map, e.g. voice mail data 
or data modem applications. A different digital signal pro- 

10 cesser implementing a different dedica t ed **voice mail** or 
"data modem** algorithm is needed for each of these other 
applications. 

As shown in FIG. 1, a conventional fax machine archi- 
tecture may be partitioned into two major functional blocks: 

is ( 1) a special purpose fax modem block of the type described 
above for recovering a compressed bit map from a modu- 
lated carrier waveform and (2) a general purpose processor 
block for performing those tasks require to convert the 
compressed bit map to a corresponding hard copy image. 

20 A well known example of a special purpose fax modem 
block is the Rockwell R96DFX MONOFAX® modem chip, 
the so-called "Rockwell Module**. In the Rockwell Module, 
the incoming modulated carrier waveform received from an 
analog channel, ie., a telephone line, is processed by an 

25 analog front end which generates a digitized replica of the 
analog signal; that is, the analog front end generates a digital 
reading of the input voltage level. A dedica t ed fax digital 
signal processor then performs the adaptive filtering, signal 
sampling, synchronization and carrier phase/frequency 

30 tracking required to reconstruct the compressed facsimile bit 
■nap from the digitized replica provided by the analog front 
end. The recovered bit map is then provided to the general 
purpose processor block which performs the additional 
processing functions required for printing the transmitted 

33 image. That is, the general purpose processor block controls 
and performs the data decompression, decoding, imaging 
and printing functions necessary to generate a hard copy 
reconstruction of the recovered bit map. 

To transmit an image, the fax machine shown in FIG. 1 
performs the above-described steps in reverse order. The 
general purpose processor block controls and performs the 
conversion of the hard copy image to a corresponding 
compressed bit map. The compressed bit map is then pro- 
vided to the special purpose fax modem block which utilizes 

43 the bit map to modulate a carrier waveform which is 
transmitted over the analog channel to a destination fax 
machine. 

A modem architecture similar to that of the Rockwell 

50 Module is also provided by the Yamaha YM7109 FAX 
modem LSI chip. 

The fax machine architecture exemplified by the Rock- 
well Module and the Yamaha modem chip, mat is, a special 
purpose fax modem block in combination with a separate 

55 general purpose processor block, suffers from a number of 
disadvantages. First, the system requires two separate pro- 
cessor functions: the special purpose DSP function of the 
modem block for recovering the compressed bit map and the 
general purpose processing and control functions of the 

60 general purpose processor block for performing the remain- 
ing tasks required to convert the compressed bit map to hard 
copy. Since there are periods of time when no facsimile 
transmissions are being received, the system's full process- 
ing capability is greatly underutilized. Furthermore, the DSP 

65 functions of the modem block are dedicated to a particular 
application, in this case, facsimile reception/transmission. 
That is, as stated above, the DSP algorithm utilized to 
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recover the incoming data is fixed; aside from the ability to 
modify the coefficients of the "fax" algorithm, there is no 
flexibility in the modem algorithm to allow it to perform 
tasks other than facsimile data recovery. This results in a 
high-cost, application-specific system architecture with 5 
redundant processing capabilities. 

A variation in the Rockwell and Yamaha modem archi- 
tectures is exemplified by the OKI KV96-X6D modem chip 
set While the architecture of the OKI modem chip set 
maintains the separate modem and general purpose proces- 10 
sor functions of the Rockwell and Yamaha modems 
described above, its analog front end and DSP functions are 
also separated Since the DSP function is programmable, 
some flexibility in the type of signal that may be processed 
is permitted However, once programmed, the DSP function 15 
of the OKI modem still relies on a fixed DSP algorithm. 
Thus, the OKI architecture has the same basic limitations 
and inefficiencies as the Rockwell and Yamaha devices. 

The Texas Instruments TMS320C25 Digital Signal Pro- ^ 
cesser provides a "general purpose** DSP capability in that 
it can accommodate a number of DSP algorithmic 
sequences. However, it relies on dedicated memory for 
storage of its DSP operations and data. Thus, it must 
incorporate its own segregated control capability aside from ^ 
that provided by the general purpose processor with which 
it is associated 

NEC IC Microsystems Ltd provides a modem DSP chip 
mat includes a DSP core that is integrated with a general 
purpose processor block. However, the DSP core of the NEC ^ 
device is dedicated to a particular algorithm and relies on its 
own control functions and an internal memory separate from 
that of the general purpose processor function for" storage 
and retrieval of its operands. Furthermore, the general 
purpose processor function is fully embedded, making it 35 
unavailable for tasks other than those related to the dedi- 
cated DSP function. 

It would, therefore, be desirable to have available a dual 
processor platform that can execute a variety of DSP algo- 
rithms while maintaining full general purpose processor 40 
capability. 

SUMMARY OF THE INVENTION 

The present invention provides a data processing system 
that utilizes integrate general purpose processor (GPP) and 45 
digital signal processor (DSP) functions that are connected 
for common access to an internal shared memory array. The 
shared memory array stores the operands for a set of basic 
DSP operations that can be executed by the DSP function. 
The sequence of DSP operations to be executed by the DSP 50 
function is selectively configurable by the GPP function; mat 
is, the general purpose processor can define a variety of DSP 
algorithms that can be executed by the DSP function for 
processing different digital input signal formats. In addition 
to storing the operands required by the DSP function for 55 
execution of a DSP algorithm, the internal shared memory 
array also stores selected instructions and data required by 
the GPP function for execution of general purpose tasks. The 
operands, insructions and data may be selectively loaded to 
the internal shared memory array from system memory. 60 
After execution of a DSP algorithm, the corresponding 
information set may be down-loaded from the internal 
memory array to system memory and a new information set 
retrieved for execution of a subsequent DSP algorithm or a 
new general purpose processor task. 65 

Thus, in accordance with the principles of the present 
invention, the general purpose processor selects a DSP 
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algorithm for conditioning and recovering digital data from 
the incoming signal. That is, the GPP selects from the set of 
basic DSP operations to define a specific sequence of DSP 
operations appropriate for processing the incoming signal 
The GPP then retrieves operands required for execution of 
the selected DSP algorithm and/or instructions and data 
critical to the GPP for controlling the DSP function or for 
performing GPP tasks and loads them into the internal 
shared memory array. Next, the GPP invokes the first DSP 
operation in the selected sequence and the DSP function 
performs the DSP operation utilizing operands retrieved by 
the DSP function from both the shared memory array and 
system memory. Upon completion of the DSP operation by 
the DSP function, the GPP function either reads the result of 
the DSP operation, invokes the next DSP operation in the 
15 selected sequence or performs a GPP task. This process 
continues until the selected sequence of DSP operations has 
been executed by the DSP function. The GPP may then 
download from the internal shared memory array the 
operands, instructions and data utilized in executing the 
20 selected DSP algorithm and either identify and execute a 
subsequent DSP algorithm fashioned from the set of basic 
DSP operations or retrieve instructions and data required for 
a separate GPP task- 
While the input signal to the data processing system may 
be received directly from a digital source, a preferred 
embodiment of the invention includes an analog front end 
that converts a modulated input signal received on an analog 
channel to a corresponding digital signal for processing by 
the data processing system. 
30 Thus, a data processing system in accordance with the 
present invention provides a unique system partitioning by 
integrating a small DSP module and a general purpose 
processor. This unique partitioning provides a single pro- 
cessor solution for both DSP and general purpose compu- 
tations that can utilize the same programming mode and the 
same system development tools for both functions. The DSP 
module provides the capability necessary to handle a variety 
of DSP requirements. The internal shared memory allows 
the DSP algorithms to be tuned or changed or new algo- 
rithms to be added to meet changing, expanding system 
requirements; general purpose computation intensive tasks 
can also be executed directly from the internal shared 
memory. 

A better understanding of the features and advantages of 
the present invention may be obtained by reference to the 
following detailed description of the invention and accom- 
panying drawings which set form an illustrative embodiment 
in which the principles of the invention are utilized. 

DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a block diagram illustrating the basic functional 
partitioning of a conventional facsimile system. 

FIG. 2 is a block diagram illustrating the basic functional 
partitioning of a data processing system in accordance with 
the present invention. 

FIG. 3 is a block diagram illustrating the primary func- 
tional units of a data processing system in accordance with 
the present invention, 

FIG. 4 is a block diagram illustrating a DSP module 
6q utUizable in a data processing system in accordance with the 
present invention. 

FIG. 5 illustrates the general purpose processor address 
mapping of a data processing system in accordance with the 
present invention. 
65 FIG. 6 is a table illustrating the memory organization of 
a complex vector for use in a DSP module in accordance 
with the present invention, 



35 



40 



43 



50 



55 



ill 



5 

FIG. 7 provides an instruction set summary for a DSP 
module in accordance with the present invention, 

FIG. 8 is a table illustrating the handling of cyclic buffers 
for a DSP module in accordance with the present invention, 

FIG. 9A is a block diagram illustrating an internal bus 
configuration of a data processing system in accordance with 
the present invention with the DSP module executing a 
VCMAG command. 

FIG. 9B is a block diagram illustrating an internal bus 
configuration of a data processing system in accordance with 
the present invention with the DSP module executing a 
VCMAD, VCMUL or VCMAC command, 

FIG. 9C a block diagram illustrating an internal bus 
configuration of a data processing system in accordance with 15 
the present invention with the general purpose processor 
executing a read or write to registers of the DSP module or 
to the internal memory array. 
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DETAILED DESCRIPTION OF THE 
INVENTION 



20 



FIG. 2 shows a data processing system 10 which is 
uniquely partitioned in accordance with the concepts of the 
present invention. The data processing system 10 is 
described below in the context of the requirements of a 25 
facsimile system. However, it will be understood by those 
skilled in the art that the principles of the invention are 
applicable to any system which receives ah mcoming data 
signal that requires digital signal processing. 

The data processing system 10 shown in FIG. 2 includes 30 
' two primary functional elements: an analog front end 12 and 
an integrated processing platform 14. The integrated pro- 
cessing platform 14 includes both a digital signal processor 
(DSP) module 16 and a general purpose processor (GPP) 18. 

The analog front end 12 converts a modulated input signal 33 
received from an analog transmission channel, e.g. a tele- 
phone line, to a digitized replica of the modulated input 
signal. Hie analog front end 12 can be implemented utilizing 
conventional, off-the-shelf integrated circuit products avail- 
able for this purpose. 40 

As stated above, the integrated processor platform 14 
includes a DSP module 16 that recovers digital data from the 
digital signal generated by the analog front end 12. The DSP 
module 16 includes a processing mftrh3mlgin t described in 
greater detail below, that conditions the digital signal uti- 
lizing an algorithm comprising a selected sequence of DSP 
operations. 

The general purpose processor 18 controls the DSP mod- 
ule 16 and processes the digital data generated by the DSP ^ 
module 16 to a desired end result The general purpose 
processor 18 may be any conventional state-of-the-art 
microprocessor. 

As further shown in FIG. 2, while in many applications, 
the analog front end 12 will be utilized to convert a modu- 55 
lated input signal received on an analog channel to a 
corresponding digital signal, there are a growing number of 
applications (e.g., ISDN and Tl) in which a digital input 
signal will be received by the integrated processor platform 
14 directly from a digital source. 50 

Referring to FIG. 3, both the DSP module 16 and the 
general purpose processor 18 are connected to an internal 
bus 20. allowing both the DSP module 16 and the general 
purpose processor 18 to communicate with a system 
memory (not shown) via a conventional bus interface unit 24 65 
for transfer of control/status information and addresses/data 
therebetween. It will be understood by those skilled in the art 
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that the internal bus 20 comprises both an internal address 
bus for handling address references by the DSP module 16 
and the general purpose processor 18 and an internal data 
bus for handling instruction and data transfers. 

5 To save bus bandwidth, the DSP module 16 stores oper- 
ands used in executing DSP algorithms in an internal RAM 
memory array 22 which, as will be described in greater 
detail below, is also accessible to general purpose processor 
18. That is, in accordance with the concepts of the present 
invention, the internal memory array 22 serves as a shared 

10 resource for both the DSP module 16 and the general 
purpose processor 18. In the illustrated embodiment, the 
internal memory is shown as accessible by the DSP module 
16 and the general purpose processor 18 via the internal bus 
20. It will be understood by those skilled in the art that other 

15 bus structures would also provide the desired shared acces- 
sibility to the internal memory array 22; for example, the 
internal memory array 22 could be implemented as a dual 
port memory. 

» As described in greater detail below, the DSP module 16 
may fetch operands in parallel from the internal memory 
array 22 and system memory. 

The DSP module 16 executes vector operations on com- 
plex variables that are optimized for DSP applications. The 
25 general purpose processor 18 treats the DSP module 16 as a 
memory mapped I/O device that occupies a reserved 
memory space, interfacing with the DSP module 16 via a set 
of memory mapped registers. 

As shown in FIG. 4, high performance is achieved in the 
30 DSP module 16 by using the internal shared memory array 
22 as well as a multiplier/accumulator 26. The DSP module 
also includes its own internal address generator 28 for 
system memory and internal operand accesses, thus reduc- 
ing the load on the general purpose processor 18. Both the 
35 iruiltipUex/accumulator 26 and the address generator 28 are 
conventional implementations . 

In the operation of the data processing system 10, the 
general purpose processor 18 selects from a basic set of DSP 
operations to define a specific sequence of operations as the 
40 DSP algorithm to be executed by the DSP module 16 for 
recovering data from the incoming digital signal. The gen- 
eral purpose processor then retrieves operands required for 
execution of the selected DSP algorithm and/or instructions 
and data critical to the general purpose processor for con- 
45 filing the DSP module 16 or for performing general 
purpose tasks and loads them into the internal RAM array 
22. The general purpose processor then invokes the first DSP 
operation in the selected sequence by issuing the corre- 
sponding command to the control register of the DSP 
50 module 16. The DSP module then places the general purpose 
processor 18 in a continuous wait state while it performs the 
first DSP operation utilizing operands retrieved by the 
address generator 28 from the RAM array 22 and system 
memory. Upon completion of the DSP operation, the DSP 
55 module cancels the continuous wait state and the general 
purpose processor 18 then either reads the status of the DSP 
module 16 or the result of the DSP operation or carries on 
with the execution of its normal program flow, which may 
be either invoking the next DSP operation in the selected 
60 sequence by issuing the appropriate command to the DSP 
module control register or performance of a general purpose 
task. This process continues until the selected sequence of 
DSP operations has been completed. The general purpose 
processor may then download the contents of the shared 
65 internal RAM array 22 and retrieve a new set of operands, 
instructions and data for further DSP operations or general 
purpose processing tasks. 
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As further shown in FIG. 4, the DSP module 16 performs 
complex arithmetic calculations on two vector operands 
provided to the multiplier/acciiniiilator 26 at Port Y and Port 
D. One vector is retrieved from the internal memory array 
22. The other vector is either organized as a circular buffer 5 
in the system memory (described in greater detail below) or 
retrieved from the internal memory array 22. 

The DSP module 16 executes vector operations in a two 
stage pipeline. This allows for a significant performance 
enhancement as the fetch and execution of operands for 1Q 
consecutive vector elements are performed simultaneously 
rather than in a strictly sequential manner. Hie DSP module 
16 can fetch up to two data elements at a time, using its 
address generator 28 for system memory access and the 
internal array 22 for the second operand. While fetching 
operands for one vector element, the DSP module 16 per- 15 
forms the multiply and add operations on the previous vector 
element 

The DSP module 16 contains seven registers in addition 
to the RAM array 22. These registers, as well as the internal 
memory array 22, are accessed by the general purpose 20 
processor 18 as memory-mapped I/O devices. As shown in 
FIG. 5, their associated addresses reside in the upper part of 
a 32-bit address range of general purpose processor 18. 
External memory locations are specified by the lower 24 
address bits and mapped to the lower 16 megabyte of this 25 
address range. 



Any reference by general purpose processor 18 to the 
registers of the DSP module 16 or to the internal memory 
array 22 is done using a bus protocol for internal control 
register access to enable external observability. This proto- 40 
col is more fully described in commonly-assigned U.S. 
patent application Ser. No. 07/750,771, filed Aug. 8, 1991, 
now UJS. Pat No. 5,212,775, which is a continuation of U.S. 
patent appliction Ser. No. 07/461,023, filed Jan. 4 , 1990, by 
Zeev Bikowsky and Dan Biran, titled METHOD AND 45 
APPARATUS FOR OBSERVING MEMORY-MAPPED 
REGISTERS, now abandoned; the just-referenced 
Bikowsky/Biran application is hereby incorporated by ref- 
erence to provide additional background information regard- 
ing the present invention. 50 

Each storage location in the internal memory array 22 is 
32 bits wide and holds one complex number. 

As stated above, the internal memory array 22 is not 
limited to storage of filtering coefficients for a specific DSP 
algorithm. It can also be used as a fast, zero-wait state, 55 
integrated memory for storing instructions and data utilized 
by the general purpose processor 18 as well as for storing 
selected operands for use by the DSP module 16 for pro- 
cessing a variety of data signal formats. 

The memory array 22 can be used for instruction fetches 60 
with only one restriction: instructions must be loaded into 
the array 22 using word aligned accesses. This can be 
achieved by moving the aligned double-word from system 
memory to memory array 22. Data can also be stored in the 
memory array 22 with one restriction: storing data in the 63 
array 22 can be done only if all the data is written using 
aligned word or double-word accesses. 
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Referring back to FIG. 4, the multiplier input register Y is 
a 32-bit register that holds one complex operand. The 
multiplier input register Y is mapped into two consecutive 
words called YO and Yl. 

The accumulator register A is a 32-bit register mat holds 
one complex result The A register is mapped into consecu- 
tive words, also called AO and Al. Internally, AO and Al are 
32-bit registers. However, only bits 15-30 (i.c, 16 bits) are 
visible. The rest of the bits are used for a higher dynamic 
range and intermediate calculations. 

A 24- bit pointer to the beginning of the data vector in the 
external system memory is provided by data pointer register 

is DFTR. In order to implement circular buffers, only the less 
significant bits of the DFTR pointer are incremented. When 
the end of a buffer is reached, the least significant bits of the 
DFTR pointer are reloaded with zeroes. The number of bits 
that are set to zero, which defines the size of the circular 

20 buffer, is controlled by a Control Register CTL, which is 
described below. The least significant word of the DFTR 
pointer is called DFTRO and the most significant byte is 
called DFTR1. 

25 The CPTR register holds the address and length of the 
coefficient vector. 

I Q 5 |lq 11* |L4 |L3 |L2 I LI |L0 Po |Sols4|S4|S3|S2|Sl | SoH 

S0-S6 Start Address of coefficient's vector 
(number of C reg.) 

LO-L6 Length of coefficient' vector 
(in doable words). 



The Control Register CTL controls the various modes of 
operation of the data processing system 10. 



I bpcl Iopco 1 psi 1 dso I x I x I clr l"corl 
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OPC1-0 Operation code. 

00 VCMAD Vector Complex Multiply Add 

01 VCMUL Vector Complex MnMpry 

10 VCMAC Vector Complex Multiply Accumulate 

11 VCMAO Vector Complex Magnitude 
DS0-DS1 Data Buffer Size. 

00 8 doable-words 

01 16 double-words 
10 32 double-words 

50 11 64 double- words 

CLR Clear Accumulator (AO and Al) before 

COJ Conjugate when set to 1, the value of the 
operand in Port D of the multiplier will be 
conjugated prior to multiplication. 



The Status Register ST holds the status of the last vector 
operation. 

eo loW | x I x I x I x I x I opi I okt] 

OPO Overflow occurred on calculation of AO. 
OPI Overflow occurred on calculation of Al. 

65 The ST register is cleared to 0 in the following cases: 
the user writes directly to either AO or Al. 
the user writes to the CTL register upon reset- 
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The operation of the DSP module 16 will now be 
described in greater detail; the following terms will be used 
in the operational description: 



Qi] 


an entry in internal memory array 22, 




entry [i] can be selected by address 




generator 28 or directly accessed by 




CPU 18; 


Dli] 


Data from system memory fetched using 




address generator 28; 


Y 


Complex Multiplier input register 30 




in FIG. 4; 


DM* 


The conjugate of D[i]; 


A 


Complex Accumulator register. 
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The DSP module 16 executes the following six basic 15 
commands: 



VCMAC \fector Complex Multiply Accumulate 

VCMAG Vfector Complex Magnitude „ 

VCMAD Vector Complex Multiply Add 

VCMUL Vfector Complex Murfipry 

LOAD Write into C f Y, A or CTL 

STORE Read from C, Y, A, ST or CTL 



The VCMAC, VCMAD and VCMUL commands use the 25 
following parameters: 



jy \fector Start Address in system memory 

C \fector Start Address in internal RAM ^ 

\ector Length 

Control bits 



The VCMAG command uses only the last three operands. 

Complex numbers are organized in the internal memory 35 
array 22 as double words. Each double word contains two 
16-bit 2's complement fractional integers. Hie less signifi- 
cant word contains the Real part of the number. The most 
significant word contains the Imaginary part of the number. 

The complex vectors utilized by the DSP module 16 
consist of arrays of complex numbers stored in consecutive 40 
addresses. Complex vectors must be aligned to double word 
boundary. FIG. 6 illustrates the memory organization of a 
vector D. 

Referring back to FIG. 4, the arithmetic logic unit 26 of 
the DSP module 16 contains a 16x16 multiplier 26a and a 45 
32-bit adder/accumulator 262>. Bits 15-30 (16 bits) of the 
result are rounded and can be read by accessing the A 
register. If an overflow is detected during an operation, the 
Status Register (ST) overflow bit and either the OPO bit or 
the OP1 bit is set to "1.** 50 

When data is loaded into the adder/accumulator 266, the 
16 bits of data are loaded into bits 15-30, the lower bits are 
set to *XT, while bit 31 gets the same value as bit 30 (sign 
extended). An overflow is detected whenever the value of bit 
30 is different from the value of bit 31. 

Each basic DSP operation or instruction to be perforated 
by the DSP module 16 is controlled by two OP-code bits 
(OPC0 and OPC1) and two specifiers (COJ and CLR). COJ 
specifies whether the operand on port D of the multiplier 26a 
must be conjugated prior to multiplication. The CLR bit is 
used to extend the instruction set. On VCMAC and 60 
VCMAG, CLR specifies whether the accumulator 26b must 
be cleared at the beginning of the vector operation. On 
VCMAD, CLR specifies that the operation will ignore the 
value of C[i]. In VCMUL, CTit indicates that the value of 
D[i] is to be taken instead of 1+D[i]. 65 

FIG. 7 provides a summary of the set of basic DSP 
operations executed by the DSP module 16 as a function of 
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the OPC1, OPC0, COJ, and CLR hits in the CTL register. In 
FIG. 7, "SIGMA" represents the summation sign: 



M 
£ 

j=l 



All the operands are complex numbers. Thus, A=SIGMA 
C[i]xD[i] breaks down to: 

10 

The accumulator 26b* the multiplier input register Y, the 

15 external data pointer DFTR and the coefficient pointer 
CFTR registers are used as temporary registers during vector 
operations. Values stored in these register prior to activation 
of the DSP module 16 are destroyed. If the content of the 
accumulator register A after an operation of the DSP module 

20 16 is used as an initial value for the next operation, it must 
be remembered that the least significant bits of (0/14) may 
contain a value of other than zero. 

As stated above, the DSP module 16 accesses arrays of 
data in external memory using the DFTR pointer as an 

25 address. The DSO and DS1 bits of the CTL register control 
the size of the array. The DSP module 16 allows a conve- 
nient way of handling data arrays as a FIFO. Onl y fee 
appropriate number of the least significant bits of the DFTR 
are incremented on each access. The upper bits remain 

30 constant FIG. 8 shows which bits are incremented. The rest 
remain constant. 

FIG. 9A illustrates the operation of the data processing 
system 10 with the DSP module 16 executing the VCMAG 
command while the general purpose processor 18 executes 

35 a general purpose task. 

As shown in FIG. 9 A, bidirectional switches S that are 
responsive to control signals provided by the general pur- 
pose processor 18 are located on the internal bus 20 so as to 
permit configuration of a variety of c ommuni c ati ons paths 

40 among the DSP module 16, the general purpose processor 18 
and the bus interface unit 24 (which, as stated above, 
provides access to external memory). 

When the DSP module 16 is executing the VCMAG 
command, the DSP module 16 is isolated from the internal 

45 bus 20 so mat the address generator 28 can retrieve operands 
for the VCMAG operation from the internal memory array 
22 for both port Y and port D of the inultipaei/accumulator 
26. Isolation of the DSP module 16 in this manner allows the 
general purpose processor 18 to reference the external 

50 memory via the bus interface unit 24 to allow transfer of data 
and instructions between the general purpose processor 18 
and external memory for simultaneous execution of a gen- 
eral purpose task. 

FIG. 9B illustrates the bus configuration during execution 

55 of the VCMAD. VCMUL or VCMAC commands by the 
DSP module 16. In this case, the address generator 28 
references an operand stored in the memory array 22 which 
is then provided to the Y port of the multipUer/accumulator 
26. The address generator 28 also references a location in 

60 external memory which provides the second operand to the 
D port of the multiplier/accumulator 26. The general pur- 
pose processor 18 is isolated from both the internal memory 
array 22 and external memory. 

FIG. 9C illustrates read and write operations by the 

65 general purpose processor 18 either to the Y register or the 
accumulator register A or to the internal memory array 22 of 
the DSP module. As shown in FIG. 9C, in this case, the 



